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Abstract — In  this  paper,  we  revisit  wavelet  theory  starting  from 
the  representation  of  a  scaling  function  as  the  convolution  of  a 
B-spline  (the  regular  part  of  it)  and  a  distribution  (the  irregular 
or  residual  part).  This  formulation  leads  to  some  new  insights  on 
wavelets  and  makes  it  possible  to  rederive  the  main  results  of  the 
classical  theory — including  some  new  extensions  for  fractional 
orders — in  a  self-contained,  accessible  fashion.  In  particular,  we 
prove  that  the  B-spline  component  is  entirely  responsible  for  five 
key  wavelet  properties:  order  of  approximation,  reproduction 
of  polynomials,  vanishing  moments,  multiscale  differentiation 
property,  and  smoothness  (regularity)  of  the  basis  functions. 
We  also  investigate  the  interaction  of  wavelets  with  differential 
operators  giving  explicit  time  domain  formulas  for  the  fractional 
derivatives  of  the  basis  functions.  This  allows  us  to  specify  a 
corresponding  dual  wavelet  basis  and  helps  us  understand  why 
the  wavelet  transform  provides  a  stable  characterization  of  the 
derivatives  of  a  signal.  Additional  results  include  a  new  peeling 
theory  of  smoothness,  leading  to  the  extended  notion  of  wavelet 
differentiability  in  the  Lp -sense  and  a  sharper  theorem  stating 
that  smoothness  implies  order. 

Index  Terms — Approximation  order,  Besov  spaces,  Holder 
smoothness,  multiscale  differentiation,  splines,  vanishing  mo¬ 
ments,  wavelets. 

I.  Introduction 

THE  mechanics  of  the  wavelet  transform  are  usually  well 
understood  by  engineers  working  in  signal  processing.  In 
essence,  the  system  boils  down  to  a  two  channel  filterbank  as 
shown  in  Fig.  1  [1],  [2].  In  the  decomposition  (or  analysis)  step, 
the  digital  signal  Si(k)  is  split  into  two  half-size  sequences 
Si+i(k)  and  fi,+  i  (kj  by  filtering  it  with  a  conjugate  pair  of 
lowpass  and  highpass  filters  and  Gfz-1),  respec¬ 

tively)  and  down-sampling  the  results  thereafter.  The  signal  is 
then  reconstructed  (synthesis  step)  by  up-sampling,  filtering, 
and  summation  of  the  components,  as  shown  on  the  right-hand 
side  of  the  block  diagram;  note  that  the  analysis  and  synthesis 
procedures  are  flow-graph  transposes  of  each  other.  A  standard 
analysis  shows  that  this  kind  of  two  channel  decomposition 
is  one-to-one  and  reversible  provided  that  the  ^-transforms 
of  the  filters  satisfy  the  perfect  reconstruction  (PR)  equations 
also  given  in  Fig.  1  [1],  [3],  In  the  tree-structured  wavelet 
transform,  the  decomposition  step  is  further  iterated  on  the 
lowpass  component  .sy+i  (k). 

While  the  block  diagram  in  Fig.  1  constitutes  a  valid  descrip¬ 
tion  of  the  algorithm — different  transforms  simply  correspond 
to  different  sets  of  PR  filters — it  tells  us  little  about  wavelets 
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Conditions  for  a  perfect  reconstruction: 

H(z-')H(z)  +  G{z-')G(z)  =  1 
H(z-])H(-z)  +  G(z~')G(-z)  =  0 


Fig.  1.  Block  diagram  of  the  two-channel  filterbank  that  forms  the  basis  for 
the  wavelet  transform  algorithm.  From  a  purely  digital  signal  processing  point 
of  view,  the  only  constraints  on  the  choice  of  the  filters  are  the  two  perfect 
reconstruction  equations. 

per  se  and  their  fundamental  properties.  To  gain  a  deeper 
understanding,  one  needs  to  consider  the  continuous-time  in¬ 
terpretation  of  the  transform  that  is  favored  by  mathematicians 
[4]— [6] .  This  representation  involves  continuously-defined 
basis  functions  (wavelets)  that  are  rescaled  and  shifted  versions 
of  each  other;  it  also  gives  rise  to  a  filterbank  implementation 
that  is  equivalent  to  the  block  diagram  in  Fig.  1 — Mai lat’s 
fast  wavelet  algorithm  [4],  [7],  However,  one  of  the  important 
points  of  the  continuous-time  formulation  is  that  the  filters 
cannot  be  completely  arbitrary.  A  key  constraint  is  that  the 
lowpass  filter  H(z ) — also  called  the  refinement  filter — must 
be  factorizable  as  H(z )  =  2-7  (l  +  £-1)  '  Q(z),  which  is  an 
expression  that  involves  some  number  of  “regularity”  factors 
(1  +  z_1)  as  well  as  a  stable  residual  term  Q(z )  satisfying 
the  lowpass  constraint  Q(  1)  =  1.  A  nontrivial  factor  Q(z)  is 
generally  necessary  for  obtaining  orthogonal  or  biorthogonal 
wavelets  that  can  be  implemented  by  means  of  FIR  filters  [8], 
[9].  The  presence  of  the  regularity  term  (l  +  z-1)  '  is  essential 
for  theoretical  reasons.  It  is  responsible  for  a  number  of  key 
wavelet  properties  such  as  order  of  approximation,  vanishing 
moments,  reproduction  of  polynomials,  and  smoothness  of  the 
basis  functions.  If  one  excepts  the  vanishing  moments,  these 
are  aspects  of  wavelets  that  are  often  not  so  well  understood  by 
signal  processing  practitioneers,  mainly  because  the  connection 
with  the  digital  filterbank  interpretation  is  not  obvious.  This 
is  rather  unfortunate  as  many  consider  these  properties  as  the 
very  core  of  wavelet  theory  (cf.  [6,  Ch.  7]);  they  are  almost 
unavoidable  if  one  wants  to  give  precise  explanations  as  to  why 
wavelets  work  so  well  for  approximating  piecewise-smooth 
signals  and  why  they  are  such  a  nice  tool  for  characterizing 
singularites  [10]. 

The  purpose  of  this  paper  is  to  demystify  some  of  these  con¬ 
cepts  by  linking  them  to  a  factorization  theorem  involving  the 
convolution  between  a  B-spline  (the  regular  part  of  the  wavelet) 
and  a  distribution  (the  irregular  or  residual  part).  Indeed,  we  will 
show  that  it  is  the  B-spline  part — and  nothing  else — that  brings 
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in  all  the  desired  mathematical  properties.  Although  the  effect 
of  the  regularity  factors  is  well  understood  by  mathematicians 
working  with  wavelets,  we  are  not  aware  of  any  deliberate  effort 
to  explain  these  properties  from  the  perspective  of  B-splines. 
While  it  could  be  argued  that  this  is  essentially  a  matter  of  inter¬ 
pretation  (regularity  factors  are  equivalent  to  B-splines),  we  be¬ 
lieve  that  the  present  formulation  makes  the  whole  matter  more 
transparent  and  accessible.  Our  only  prerequisite  here  is  to  have 
a  complete  understanding  of  the  properties  of  B-splines,  which 
is  much  easier  than  for  other  wavelets,  since  these  are  the  only 
ones  that  have  explicit  formulas.  We  then  use  relatively  simple 
manipulations  to  show  that  these  properties  carry  over  to  all 
scaling  functions  through  the  convolution  relation. 

A.  Scope  and  Organization  of  the  Paper 

This  paper  is  written  for  researchers  in  signal  processing  who 
have  a  basic  knowledge  of  wavelets  and  filterbanks  and  who 
would  like  to  improve  their  understanding  of  the  more  theoret¬ 
ical  aspects  of  wavelets  that  are  usually  confined  to  the  math¬ 
ematical  literature.  The  paper  is  largely  self-contained  but  as¬ 
sumes  some  familiarity  with  Mallat’s  multiresolution  formula¬ 
tion  of  the  wavelet  transform  that  rests  on  the  fundamental  no¬ 
tion  of  scaling  function  [4],  From  a  theoretical  point  of  view,  the 
scaling  function  is  more  important  than  the  wavelet  because  it 
provides  the  elementary  building  blocks  for  the  decomposition; 
it  is  responsible  for  the  key  properties  of  the  transform — and  this 
is  precisely  what  this  paper  is  all  about. 

The  presentation  is  organized  as  follows.  In  Section  II,  we 
recall  the  main  definitions  and  mathematical  concepts  that  are 
required  for  the  L-?  formulation  of  the  wavelet  transform.  We 
also  spend  some  time  describing  the  fractional  B-splines,  which 
play  a  central  role  in  our  formulation.  In  Section  III,  we  red¬ 
erive — and  often  extend — the  key  theorems  of  wavelet  theory 
starting  from  the  B-spline  factorization  of  the  scaling  function. 
Finally,  in  Section  IV,  we  investigate  the  smoothness  issue  and 
the  interaction  of  wavelets  with  fractional  differential  operators. 
An  important  aspect  of  our  formulation  is  that  it  allows  for  non¬ 
integer  orders  7,  which  are  not  covered  by  traditional  wavelet 
theory.  This  brings  in  two  novelties; 

1)  the  extension  of  the  theory  for  fractional  wavelets  such  as 
those  introduced  in  [11]; 

2)  a  new  “peeling”  theory  of  smoothness  which  general¬ 
izes  an  interpretation  of  integer  orders  of  differentiability 
given  by  Strang  [6], 

The  motivation  here  is  to  come  up  with  a  more  intuitive — and 
perhaps  even  more  general — understanding  of  the  concept  of 
wavelet  smoothness.  Another  important  goal  of  this  paper  is  to 
clarify  the  notion  that  the  wavelet  representation  is  stable  with 
respect  to  (fractional)  differentiation. 


II.  Scaling  Functions  and  Wavelets  in  L2 

This  section  presents  a  brief  review  of  the  L2  formulation  and 
interpretation  of  the  wavelet  transform.  It  also  contains  a  short 
primer  on  fractional  B-splines. 


A.  Continuous  Function  Spaces  and  Notations 

The  continuous-time  version  of  the  wavelet  transform  applies 
to  functions  f(x),  x  £  R  that  are  square  integrable.  The  space 
of  those  functions  is  denoted  by  L2\  it  is  the  Hilbert  space  that 
corresponds  to  the  inner  product 

/+00 

f(x)g(x)dx  (1) 

■OO 

where  the  integral  is  taken  in  Lebesgue’s  sense.  The  energy  of 
a  function  /( x)  is  given  by  its  squared  L2-norm\  ||/||^o  = 
(/,  f)Lf,  thus,  the  notation  /  £  L2  is  equivalent  to  the  state¬ 
ment  that  | I/ll ^  is  finite.  More  generally,  one  defines  the  Lp 
spaces  for  1  <  p  <  +00  (where  L  stands  for  Lebesgue)  as  the 
set  of  functions  whose  ip-norm 


is  finite.  These  are  Banach  spaces;  there  is  no  corresponding 
inner  product,  except,  of  course,  for  p  =  2. 

The  Fourier  transform  of  f(x)  is  denoted  by  f(uj).  If  /  £  L\, 
it  is  given  by  f(x)e~:>ulxdx.  This  definition  can  be  ex¬ 
tended  for  functions  /  £  Lp,  as  well  as  for  generalized  func¬ 
tions  //  £  S',  where  S'  stands  for  Schwarz’  space  of  tempered 
distributions  on  R  [12], 

In  wavelet  theory,  one  usually  considers  some  wavelet  func¬ 
tion  '(/;(;/;)  that  generates  a  basis  of  L2.  What  is  meant  here  is 
that  every  function  /  £  L2  can  be  represented  in  a  unique  and 
stable  fashion  using  the  expansion 

f(x)  =  ^2^2  di(k)^k(x).  (3) 

ie z  fee z 

where  the  basis  functions  fji.kix)  =  2~l^‘1ip{x/2q'  —  k )  are 
obtained  by  dilation  (index  i)  and  translation  (index  k)  of  the 
wavelet  ip(x).  The  wavelet  coefficients  di(k )  are  obtained  by 
forming  the  (double  infinite)  sequence  of  inner  products 

di(k)  =  (4) 

where  \tpi  k>  is  the  biorthogonal  basis  of 

L  ’  J  ie z,fcez 

iei_,ke%~  such  that  Iff i,k i'll) j.i^  —  ^i—j  *  /■  The 

biorthogonal  basis  is  also  generated  by  a  single  template 
il>(x) — the  so-called  biorthogonal  wavelet  [8].  In  practice,  the 
inner  products  are  never  evaluated  literally  as  in  (4).  They  are 
computed  much  more  efficiently  using  Mallat’s  algorithm, 
which  is  based  on  a  hierarchical  application  of  the  filterbank 
decomposition  in  Fig.  1  [4].  The  wavelet  transform  specifies 
the  choice  of  a  particular  set  of  perfect  reconstruction  filters, 
and  vice  versa  [cf.  (6)  and  (8)]. 

B.  Scaling  Functions 

The  continuous-time  interpretation  of  the  wavelet  decompo¬ 
sition  algorithm  in  Fig.  1  is  based  on  the  fundamental  concept  of 
scaling  function  [4].  The  scaling  function  <p(x)  is  either  given 
explicitly — as  is  the  case  with  the  B-splines — or  it  is  derived 
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(a)  <0®  (*),  0®  (x  -  k))  =  8*  <=> 


^c(k)$°+(x  -  k) 

ksZ 


|2 


2 

c  t  (Parseval) 


(b)  0®U  /  2)  =  P°+«  +  P°+(*  - 1)  (c)  V(A-  /  2)  =  0®  (x)  -  P°+U  - 1) 

4 


4 - 1 - 1 - 1 - 1 - : 

(d)  £p°(*-*)  =  l 

*eZ 

Fig.  2.  Illustration  of  the  key  properties  for  the  Haar  case  (p(x)  =  /^(rc)  (B-spline  of  degree  zero),  (a)  Orthogonality,  (b)  Two-scale  relation,  (c)  Wavelet 
relation,  (d)  Partition  of  unity. 


indirectly  from  the  refinement  filter  H(z)  as  solution  of  the 
two-scale  relation  [9].  The  difficulty  with  the  latter  approach  is 
that  not  all  filters  do  generate  valid  scaling  functions  [13],  [14]. 
The  mathematical  requirements  are  the  following. 

Definition:  tp(x)  is  an  admissible  scaling  function  of  L2  if 
and  only  if  it  satisfies  the  following  three  conditions: 

i)  Riesz  basis  criterion;  there  exist  two  constants  A  >  0  and 
B  <  +00  such  that 

A  <  +  27rn)|2  <  B  (5) 

n£  z 

ii)  Refutability 

<p(|)  =2^2h(k)<p(x-k)  (6) 

fee  z 

iii)  Partition  of  unity 

E  V(x  -  k)  =  1-  W 

kez 

These  three  conditions  are  satisfied  by  the  Haar  function 
0\.{x) — or  B-spline  of  degree  zero — as  illustrated  in  Fig.  2. 


Condition  i)  ensures  that  ip  generates  a  stable  basis  for  the  basic 
function  space 


V(0)  =  <  s(x)  =  V'  c(k)tp(x  -k),  c<El2 


kez 


In  other  words,  there  is  an  equivalence  between  the  i^-norm  of 
the  functions  in  Vjp)  and  the  1 2 -norm  of  their  coefficients 


Vc  £  I2,  A  -  llcll2  < 
l2 


J2c(k)ip(x  -  k) 
k 


2 

L2 


where  A  and  B  are  the  Riesz  bounds  of  ip.  The  basis  is  or¬ 
thonormal  if  and  only  if  A  —  B  =  1,  as  is  precisely  the  case 
with  the  example  in  Fig.  2(a). 

The  two-scale  relation  (6),  which  is  equivalent  to  ip(x/ 2)  £ 
V(q),  is  the  key  to  the  multiresolution  structure  of  the  transform. 
It  allows  us  to  define  the  coarse-to-fine  sequence  of  embedded 
subspaces  {0}  C  •  •  •  C  %)  C  •  •  •  C  V(i)  C  V(0)  C  V(_i)  C 
■■■  Cl2  such  that  ffix)  £  V(i)  <S>  ffi 2lx)  £  F(0). 

The  last,  more  technical  partition  of  unity  condition  is  neces¬ 
sary  and  sufficient  (under  mild  conditions)  for  the  approxima¬ 
tion  error  to  vanish  as  the  wavelet  size  approaches  zero  [15].  It 
ensures  that  the  multiresolution  decomposition  is  dense  in  L2. 

We  can  prove  that  the  above  specification  of  a  valid  scaling 
function  is  equivalent  to  the  axiomatic  definition  of  a  multires- 
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olution  analysis  of  fx>  given  by  Mallat  [7].  Hence,  we  have  the 
guarantee  that  there  exists  some  corresponding  wavelet  ip(x) 
that  generates  a  Riesz  basis  of  Lo-  The  wavelet  is  expressed  as 
a  linear  combination  of  shifted  versions  of  the  scaling  function: 

V’(l)  =  2^2  g(k)p(x  -  k)  (8) 

fee  z 

where  the  weights  g(k)  correspond  to  the  impulse  response  of 
the  wavelet  synthesis  filter  in  Fig.  1.  The  dual  scaling  function 
0  and  wavelet  0  are  defined  in  a  similar  fashion  using  the  dual 
versions  of  (6)  and  (8),  respectively;  these  involve  the  analysis 
filters  h  and  g  in  Fig.  1 . 


C.  Fractional  B-Splines 

The  simplest  example  of  a  scaling  function  is  the  B-spline  of 
degree  0  fcf.  Fig.  2).  This  function  can  be  constructed  by  taking 
the  difference  of  two  step  functions  shifted  by  one.  This  yields 
the  following  mathematical  representation: 

/3+(x)  :=  x+  —  (x  —  1)°  =  A+;c+ 


where  x"  denotes  the  one-sided  power  function  = 

max{0,a;}Q;  A+  is  the  causal1  finite  difference  (convolution) 
operator  whose  impulse  response  is  S(x )  —  S(x  —  1).  By 
convolving  /39  with  itself  n  times,  one  generates  the  classical 
B-splines  which  are  piecewise  polynomials  of  degree  n[16]. 
These  are  also  valid  scaling  functions  simply  because  the  con¬ 
volution  of  two  scaling  functions  is  a  scaling  function  as  well.  It 
is  still  possible  to  go  one  step  further  by  considering  fractional 
convolution  products  that  yield  the  fractional  B-splines  [11]: 


&{x) 


A"+1a;“ 

T(a  +  1) 


^l-e-^y+1 


(10) 


The  right-hand  side  of  this  formula  is  easy  to  understand:  It  is 
the  (a  +  l)th  power  of  the  Fourier  transform  in  (9).  The  corre¬ 
sponding  definition  of  the  fractional  power  of  a  complex  number 
z  is  za  =  \z\  eJQarg(z)  with  arg (z)  £  [— 7r,  7 r[.  The  time-do- 
main  formula  on  the  left-hand  side  can  be  obtained  by  inverse 
Fourier  transformation  (cf.  [11]).  The  key  operator  here  is  the 
fractional  causal  finite  difference  A“+1,  which  is  best  defined 
in  the  Fourier  domain 


Aa++1f(x)J^(l-e-^)a+1  (ID 


To  obtain  its  time  domain  representation,  we  expand 
(l  —  e_-JW)a+1  using  Newton’s  generalized  binomial  the¬ 
orem  and  perform  an  inverse  Fourier  transform 

+oo 

A %+1f(x)  =  aa+1(k)f(x  -  k ) 
k= 0 

k  (  a  +  1 

V  * 

’the  subscript  “+”  is  used  to  signify  that  the  corresponding  function  or  con¬ 
volution  operator  is  causal;  this  is  to  distinguish  them  from  the  centered  version 
of  these  operators  which  are  often  used  in  spline  theory. 


with  aa+1{k)  =(-l) 


Fig.  3.  Fractional  B-splines  for  a  >  0.  The  polynomial  ones  (a 

integer)  are  represented  using  thicker  lines. 


where  ^  ^  ^  =  T(m  +1)/ (T(t;  +  l)r(ti  —  v  +  1));  T(m+  1)  is 

Euler’s  Gamma  function,  which  interpolates  the  factorial,  i.e., 
T(k  +  1)  =  k\  for  k  integer.  If  we  specialize  these  formulae 
for  the  more  standard  integer  case  ( a  =  n ),  then  (10)  yields  the 
classical  finite  difference  formula  for  the  polynomial  B-splines 
(cf.  [17]). 

The  fractional  B-splines  are  shown  in  Fig.  3;  they  provide  a 
progressive  transition  between  the  polynomial  ones  (a  integer) 
displayed  in  thicker  lines.  These  functions  have  a  number  of  in¬ 
teresting  properties  that  are  briefly  summarized  here — for  more 
details,  see  [11], 


•  They  generate  valid  Riesz  bases  for  a  >  —1/2.  In  par¬ 
ticular,  this  means  that  they  are  square  integrable,  i.e., 
j0?  G  Li,  Vet  >  -1/2. 

•  They  belong  to  Lp  for  all  yv  >  1  whenever  a  >  0,  and  for 
all  1  <  p  <  l/-a  if  a  <  0. 

•  They  satisfy  the  convolution  relation:  /3"1  *  /3"2  = 

^oi+a2+i  This  comes  as  a  direct  consequence  of  their 
definition. 

•  They  reproduce  polynomials  of  degree  n  =  fa],  where 
fa]  denotes  the  ceiling  of  a.  In  particular,  they  satisfy  the 
partition  of  unity  for  a  >  —  1. 

•  They  satisfy  the  two-scale  relation.  Their  refinement  filter 
is  the  generalized  binomial.  This  is  established  easily  by 
expressing  the  two-scale  relation  in  the  Fourier  domain 
and  evaluating  the  following  ratio: 


7T“(0  = 


= 

l+e~ju 


1 
2 
Cfc+1 


-2  jw 


1  -  e 


a-\-l 


(13) 


In  fact,  the  fractional  B-splines  share  all  the  properties  of  the 
polynomial  ones  (cf.  [18])  with  two  notable  exceptions:  They 
are  not  strictly  positive  nor  compactly  supported  unless  a  is  in¬ 
teger.  For  a  noninteger,  they  decay  like  x~a~ 2  when  x  — >  +oo. 
Their  lack  of  compact  support  is  also  apparent  if  we  look  at  their 

refinement  filter  =  2_(“+1^  fa  jjf  ’  V  which  is  infin¬ 


itely  supported  whenever  a  is  noninteger,  although  it  decays  like 

k~ia +2/  }•  4-00. 
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D.  Differentiation  and  Smoothness 

Our  primary  motivation  for  considering  fractional  splines  in¬ 
stead  of  conventional  ones  is  that  the  enlarged  family  happens 
to  be  closed  under  fractional  differentiation.  This  will  prove  ex¬ 
tremely  useful  for  understanding  and  characterizing  the  smooth¬ 
ness  properties  of  scaling  functions  and  wavelets. 

Our  fractional  derivative  operator  d 7  is  defined  in  the  Fourier 
domain 

ar'f(x)J^(jLjy'f(w).  d4) 

It  coincides  with  the  usual  derivative  operator  when  7  is  integer. 
Applying  the  operator  to  the  fractional  B-splines  yields  the  fol¬ 
lowing  explicit  differentiation  formula 

87P%{x)  =  tf+0$T{x)  (15) 

where  Al  is  the  fractional  finite  difference  operator  defined 
by  (12).  Thus,  the  fractional  derivative  of  a  B-splines  happens 
to  be  spline  with  a  corresponding  decrease  of  the  degree.  The 
derivation  of  this  result  is  rather  straightforward  if  we  work  in 
the  Fourier  domain 

/ 1  -  e~ju 

an^{x)^{juty  •  — 

=  (1- 

Again,  this  is  an  extension  of  the  well-known  differentiation 
formula  for  the  polynomial  B-splines. 

Thus,  if  we  differentiate  fjf  (x)  a  times,  we  end  up  with 
a  function  da/3f_  =  A“/3°(tc)  that  is  piecewise  constant  and 
bounded  (because  aQ(k )  £  / 1  for  a  >  —1).  The  boundedness 
of  the  derivative  is  an  indication  that  iff  is  a-Holder  continuous. 
In  fact,  /?“( x)  is  infinitely  differentiable  everywhere  except  at 
the  knots  (integers),  where  it  has  discontinuities  of  order  a  (e.g., 
(x  —  k)f).  Holder  smoothness  is  a  pointwise  measure  of  conti¬ 
nuity  that  is  often  used  to  characterize  wavelets  [19].  In  this  con¬ 
text,  the  Sobolev  smoothness,  which  is  a  more  global  measure 
of  differentiability  in  the  i2_sense-  is  also  important  [6],  [14], 
[20],  The  critical  Sobolev  exponent  for  the  fractional  B-splines 
is  A' ma,x  =  ol  +  1/2.  In  other  words,  f3f(x )  has  smax  derivatives 
in  the  Z2-sense,  i.e.,  ||3s/3"  ||Li  <  +oo  for  0  <  s  <  a  +  1/2. 

III.  Wavelet  Theory  Revisited 

Our  goal  in  this  section  is  to  reformulate  wavelet  theory  using 
a  nontraditional  point  of  view.  Our  argumentation  is  entirely 
based  on  a  B-spline  factorization  theorem  (Section  III-B),  which 
is  intimately  related  to  the  crucial  notion  of  approximation  order. 
We  will  use  this  representation  to  derive  the  most  important 
wavelet  properties  using  relatively  straightforward  manipula¬ 
tions.  In  other  words,  we  will  show  that  the  order  of  approxima¬ 
tion  of  the  transform  (Section  III-A),  the  reproduction  of  poly¬ 
nomials  property  (Section  III-C),  the  vanishing  moments  of  the 
wavelet  (Section  III-D),  and  the  multiscale  differentiation  prop¬ 
erty  of  the  wavelet  (Section  III-E)  can  all  be  attributed  to  the 
B-spline  that  lies  hidden  within. 


A.  Order  of  Approximation 

A  fundamental  idea  in  wavelet  theory  is  the  concept  of  mul¬ 
tiresolution  analysis.  There,  one  usually  considers  a  sequence 
of  embedded  subspaces  with  a  dyadic  scale  progression,  i.e., 
a  =  2 l,  i  £  Z.  Specifically,  the  approximation  space  at  scale 
a  =  2l  (or  at  resolution  level  i)  is  defined  as 

Va=\  Sa(x)  =  ^  C(k)V  ~  k)  :  C(k )  e  \  ■  (16) 

l  fee  z  a  J 

Because  of  the  two-scale  relation,  the  subspaces  have  the  fol¬ 
lowing  inclusion  property:  V2a  C  Va.  Given  some  input  func¬ 
tion  /  £  Lo,  one  considers  its  approximation  in  Va.  The  best 
approximation  in  the  least-squares  sense  (minimum  7,2 -norm) 
is  given  by  the  orthogonal  projection  into  Va 

V/  £  L2l  Paf  =  arg  min  ||/  -  sa\\Lo  £  Va.  (17) 

sa€Va 

Because  of  the  Riesz  basis  property,  the  approximation  Paf  of 
/  in  Va  is  always  well-defined  and  unique  (Projection  theorem). 
A  necessary  condition  for  being  able  to  construct  wavelet  bases 
of  L2  is  that  the  approximation  improves  as  the  scale  a  gets 
closer  to  zero.  In  other  words,  we  want  to  be  able  to  approxi¬ 
mate  any  /^-function  as  closely  as  desired  by  selecting  a  scale 
(or  sampling  step)  that  is  sufficiently  small.  It  is  therefore  crucial 
to  understand  and  characterize  the  rate  of  decay  of  the  approxi¬ 
mation  error  as  a  function  of  the  scale.  This  leads  to  the  notion 
of  order  of  approximation,  which  plays  a  fundamental  role  in 
wavelet  theory  [6],  [21],  [22], 

Definition:  A  scaling  function  ip  has  order  of  approximation 
7  if  and  only  if 

V/  G  W7,  ||/  -  Paf\\L2  <  C  ■  a7  \\d7f\\L2  (18) 

where  C  is  a  constant  that  may  depend  on  ip  but  not  on  /;  the 
condition  /  £  Wf  (Sobolev  space  of  order  7)  ensures  that 
\\dnf\\Lo,  which  is  the  L2-t\om\  of  the  7th  derivative  of  /,  is 
finite  so  that  the  bound  is  well  defined. 

This  definition  means  that  for  smooth  signals,  the  /^-approx¬ 
imation  error  should  decay  like  the  7th  power  of  the  scale.  All 
popular  families  (Daubechies,  Battle-Lemarie,  etc.)  satisfy  the 
order  constraint  (29)  and  are  indexed  by  their  order  7,  which  is 
always  a  positive  integer.  The  least  sophisticated  transform — 
Haar  with  <p(x)  =  0\.{x) — has  a  first  order  of  approximation 
(7  =  1),  which  is  enough  for  the  approximation  error  to  vanish 
as  a  —a  0.  However,  one  usually  prefers  to  work  with  higher 
order  approximations  which  result  in  much  faster  convergence; 
typically,  7  =  4  (e.g.,  the  9/7  Daubechies  and  cubic  spline 
wavelets).  While  traditional  wavelet  theory  requires  7  to  be  in¬ 
teger,  our  definition  is  also  valid  for  noninteger  orders  7  £  R+ 
and  applicable  to  the  recent  fractional  extensions  of  wavelets 
[11]. 

The  way  in  which  the  order  of  approximation  translates  into 
filter  design  constraints  is  made  explicit  by  the  following  the¬ 
orem. 
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(a) 


(b) 


(c)  (d) 

Fig.  4.  Plot  of  Y+Y  '1n  /,  17  ( -r  _  /•')  for  various  values  of  a.  (a)  a  =  0  (piecewise  constant),  (b)  a  =  0.5  (fractional  spline),  (c)  a  =  1  (linear  spline),  (d) 
o  =  3  (cubic  spline).  This  illustrates  the  property  that  the  B-splines  are  able  to  reproduce  a  first  order  polynomial  as  soon  as  o  >  0  (i.e.,  cases  b,  c,  and  d,  but  not 
a). 


Theorem  1:  A  valid  scaling  function  ip(x )  has  a  7th-order  of 
approximation  if  and  only  if  its  refinement  filter  H(z)  can  be 
factorized  as 

H(z)  =  (19) 

V*  v*  ^  distributional  part 
spline  part 

where  Q(z )  is  stable,  i.e.,  |Q(e-7t''’)|  <  C. 

The  restricted  version  of  this  theorem  for  7  compactly  sup¬ 
ported — or  equivalently  H(z)  FIR — with  7  integer  is  a  standard 
result  in  wavelet  theory  (cf.  [6]).  The  important  point  here  is  that 
the  present  version  holds  for  any  real  7  >  0  with  minimal  re¬ 
striction  on  7  (Riesz  basis).  The  proof  is  technical  and  will  be 
published  elsewhere  [23], 

B.  B-Spline  Factorization  Theorem 

The  idea  here  is  to  interpret  Theorem  1  in  terms  of  B-splines. 
If  we  were  to  consider  the  first  term  of  the  factorization  alone, 
then  the  refinement  filter  would  generate  a  fractional  B-spline 
of  degree  a  =  7  —  I  [cf.  (13)].  The  Q(z)  filter  alone,  on  the 
other  hand,  would  generate  some  distribution  77  £  S',  which 
is  typically  not  a  true  function  (in  fact,  we  will  see  later  on  that 
it  cannot  have  any  Sobolev  regularity  at  all).  The  only  thing  we 
can  say  about  70  is  that  it  is  lowpass  (because  Q(  1)  =  1).  Thus, 
because  of  the  convolutional  nature  of  the  two-scale  relation,  we 
obtain  the  following  corollary,  which  constitutes  the  main  result 
on  which  we  will  build  our  wavelet  theory. 

Theorem  2  (B-Spline  Factorization):  7  is  a  valid  scaling 
function  of  order  7  if  and  only  if  its  Fourier  transform  7  can 
be  factorized  as 

7>M  =  /3+-1(w)  •  70M 


where  fk^~]  (lo)  is  the  Fourier  transform  of  a  fractional  B-spline 
as  given  by  (10),  and  (po(uj)  is  a  true  function  of  u  bounded 
on  every  closed  interval.  Because  of  our  assumptions  on  7,  this 
corresponds  to  a  well-defined  convolution  product  in  the  time 
domain 


It  is  therefore  always  possible  to  express  a  scaling  function  as  the 
convolution  between  a  B-spline  and  a  distribution.  What  The¬ 
orem  2  also  tells  us  is  that  the  B-spline  part  is  entirely  respon¬ 
sible  for  the  approximation  order  of  the  transform.  We  will  now 
use  the  convolution  relation  (20)  to  show  that  the  B-spline  part 
brings  in  three  other  very  useful  properties. 

C.  Reproduction  of  Polynomials 

We  have  already  mentioned  that  the  B-splines  of  degree  a 
have  the  ability  to  reproduce  polynomials  of  degree  lesser  or 
equal  to  n  =  [a].  Practically,  this  means  that  we  can  gen¬ 
erate  all  polynomials  by  taking  suitable  linear  combination  of 
B-splines.  In  particular,  one  can  construct  the  following  series 
of  polynomials  for  p  =  0, . . . ,  n  =  [ct] : 

y  kp) 3+(x  -  k)  =  apfiXp  + - b  with  ap%0  =  1  (21) 

fee  z 

which  form  a  basis  of  the  polynomials  of  degree  n  because 
ap  (l  f  0.  The  ability  of  B-splines  to  reproduce  a  straight  line 
(p  =  1)  is  illustrated  in  Fig.  4. 

The  fact  that  (21)  holds  for  the  conventional  case  a  =  n, 
with  n  integer,  is  not  too  surprising  because  the  B-splines  are 
themselves  piecewise  polynomial  of  degree  n.  The  noninteger 
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case  [cf.  Fig.  4(b)]  is  less  intuitive  but  is  only  truly  relevant  for 
fractional  wavelets,  which  exhibit  a  few  exotic  properties  [11], 

The  main  point  of  this  section  is  that  the  polynomial  repro¬ 
duction  property  is  preserved  through  the  convolution  relation 
(20). 

Proposition  3:  Let  ipo  be  any  distribution  such  that 
f  ipo(x)dx  =  1  and  J  xz(po(x)dx  <  +oo,  i  =  1, . . .  ,n.  Then, 
c p(x )  =  3"  *  ifio(x)  reproduces  the  polynomials  of  degree 
lesser  or  equal  to  n  =  [a] . 

When  7  is  integer.  Proposition  3  is  a  B-spline  reformulation 
of  a  standard  result  in  wavelet  theory  [5],  [6].  What  is  novel  here 
is  the  extension  for  7  =  a  +  1  noninteger,  which  is  nontrivial. 

Proof:  We  assume  that  the  residual  filter  q  has  sufficient 
(inverse  polynomial)  decay  for  the  moments  of  ipo  to  be 
bounded  up  to  order  n.  In  other  words,  we  want  ipo( w)  to  be  n 
times  differentiable  at  the  origin.  This  mild  technical  require¬ 
ment — which  is  much  weaker  than  compact  support — ensures 
that  the  convolutions  and  manipulations  below  are  well-defined 
in  the  distributional  sense. 

We  start  by  showing  that  the  convolution  between  (po  and  the 
monomial  xv  produces  a  polynomial  qp(x )  of  degree  p 


=xp  +  hxp  1  -I - 1-  bp  =  qp{x) 

with  bk  =  (-l)k  (j^  m^0. 


The  leading  coefficient  is  one  because  Q(  1)  =  1  7  m®0  = 
J  (po(x)dx  =  </?o(0)  =  1-  We  then  use  this  result  to  evaluate  the 
sum 

kp(p(x  —  k )  =  kp  (tp 0  *  /3“)  (x  —  k ) 
fcez  ke 1 

v 

=Po  *  ap,kxp-k  [using  (20)  and  (21)] 
k= 0 

p 

—  ^  —  tcp  T  C\XP  -(-■*■  ttp. 

k= 0 

In  addition,  here,  the  leading  coefficient  is  one  implying  that  the 
sequence  of  these  polynomials  for  p  =  0, ....  n  is  a  basis  for 
the  polynomials  of  degree  n.  □ 

This  polynomial  reproduction  property  is  illustrated  in  Fig.  5 
for  the  Daubechies  scaling  function  of  order  2.  Although  no  one 
will  question  the  fact  that  the  linear  B-splines  reproduce  the  con¬ 
stant  and  the  ramp,  it  is  much  less  obvious  that  the  fractal-like 
Daubechies  functions  are  endowed  with  the  same  property. 

For  sake  of  completeness,  we  mention  the  existence  of  a  con¬ 
verse  implication  which  goes  back  to  the  Strang-Fix  theory  of 
approximation. 

Theorem  4:  If  a  function  p(x)  reproduces  the  polynomials  of 
degree  n  and  is  compactly  supported,  then  p(x)  =  (3 f*  <po(x). 

This  result  was  first  conjectured  by  Strang  in  [24]  and  was 
later  proven  by  Ron  in  a  more  abstract  framework  [25].  A  recent 
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Fig.  5.  Illustration  of  the  reproduction  of  polynomials  of  degree  0  and  1  using 
Daubechies  scaling  functions  of  order  6  =  2 . 

proof  that  is  more  accessible  to  an  engineering  audience  is  avail¬ 
able  in  [26],  Note  that  Theorem  4 — the  converse  part  of  Propo¬ 
sition  3 — is  only  applicable  in  the  conventional  framework:  <p 
compactly  supported,  and  n  =  7  —  1.  We  have  already  encoun¬ 
tered  some  counter  examples  with  the  fractional  B-splines  (i.e., 
n  =  [7  —  1]  f  7  —  1),  which  are  not  compactly  supported. 

In  addition,  note  that  the  results  in  this  section  do  not  require 
the  two-scale  relation.  As  such,  they  are  also  applicable  outside 
the  wavelet  framework — for  instance,  in  the  context  of  interpo¬ 
lation  [27], 

D.  Vanishing  Moments 

A  wavelet  transform  is  said  to  have  n  +  1  vanishing  mo¬ 
ments  if  the  analysis  wavelet  ip  is  such  that  J  xpip(x)dx  =  0, 
p  =  0, . . . ,  n.  These  vanishing  moments  are  undoubtedly  one  of 
the  better  known  and  most  useful  properties  of  wavelets  [10].  In 
particular,  they  give  the  transform  the  ability  to  kill  all  polyno¬ 
mials  of  degree  n  [5],  [21],  This  property  translates  into  a  sparse 
representation  of  piecewise  smooth  signals  because  the  wavelet 
coefficients  will  be  essentially  zero  over  all  regions  where  the 
signal  is  well  approximated  by  a  polynomial,  e.g.,  the  first  few 
terms  of  its  Taylor  series.  This  will  produce  streams  of  zero  co¬ 
efficients  that  can  be  coded  with  very  few  bits  [28],  [29].  The 
vanishing  moments  also  allow  the  characterization  of  singular¬ 
ities  based  on  the  decay  of  the  coefficients  across  scale — the 
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so-called  persistence  across  scale  property  of  the  wavelet  trans¬ 
form  [10],  [30],  [31].  The  study  of  this  decay  plays  an  important 
role  in  the  analysis  of  fractal  and  multifractal  signals  [32], 

The  vanishing  moments  are  nothing  but  an  indirect  manifes¬ 
tation  of  the  ability  of  the  scaling  function  to  reproduce  polyno¬ 
mials. 

Proposition  5:  If  the  scaling  function  <p(x)  reproduces  the 
polynomials  of  degree  n ,  then  the  analysis  wavelet  ip  has  n  +  1 
vanishing  moments. 

This  is  a  standard  result  in  wavelet  theory  (cf.  [5],  [6])  that  is 
rederived  here  for  sake  of  completeness. 

Proof:  The  polynomial  reproduction  property  is  equiv¬ 
alent  to  saying  that  the  polynomials  are  in  the  linear  span 
of  {ip(x  —  fe)}fceZ.  Since  the  wavelet  ip  is  perpendicular  to 
{ip(x  —  k)}keZ  by  construction  fcf.  the  general  biorthogonal 
wavelet  formulation  of  Cohen-Feauveau-Daubechies  [8]),  it  is 
therefore  perpendicular  to  the  polynomials;  in  particular,  the 
monomials  xp  with  p  =  0, . . . ,  n.  □ 

Thus,  by  combining  Propositions  3  and  5,  we  can  claim  that 
it  is  the  B-spline  component  once  again  that  is  entirely  respon¬ 
sible  for  the  vanishing  moments  of  ip.  The  above  argument  can 
also  be  applied  to  the  synthesis  side  of  the  transform.  In  other 
words,  if  the  dual  scaling  function  can  be  factorized  as  p(x)  = 
/37_1  *  (pQ  ( x ) — meaning  that  p  is  of  order  7 — then  the  synthesis 
wavelet  p  will  have  exactly  n  +  1  =  [7]  vanishing  moments 
(general  biorthogonal  case).  When  the  analysis  and  synthesis 
spaces  are  identical  (orthogonal  and  semi-orthogonal  cases),  the 
number  of  vanishing  moments  are  the  same  on  both  sides. 


E.  Multiscale  Differentiator 

We  will  now  show  that  another  consequence  of  the  B-spline 
factor  is  that  the  analysis  wavelet  ip  essentially  behaves  like 
a  7th-order  differentiator.  The  proof  of  this  property  rests  al¬ 
most  entirely  on  the  perfect  reconstruction  property  of  the  fil- 
terbank  in  Fig.  1.  In  the  sequel,  we  will  assume  that  the  four 
filters  H(z~1),  H(z),  G( z -1),  and  G(z )  are  stable  in  the  sense 
that  their  Fourier  transforms  are  bounded.  We  also  recall  that  the 
perfect  reconstruction  property  has  an  equivalent  I2  -formulation 
that  is  expressed  by  four  biorthogonality  relations  between  the 
various  filters  pairs,  which  can  be  written  in  matrix  form,  cf. 
[33] 


/GO*-1)  G(-z~l)\T  (  G(z)  G(-z)\_ 
\Hiz-1)  H(-z^)J  \H(z)  H(—z)  J 

S - V - 'v - V - ' 

M(z-1)T  M(z) 


(22) 


Proof:  Let  us  denote  by  l)(z)  and  D(z)  the  determinant 
of  the  matrices  M(z)  and  M(z)  in  (22),  respectively.  Then, 
clearly,  D(z~1)D(z)  =  1.  By  inverting  M/g-1)7/  we  get 

G(z)  =  HD{z-P  =  ^  <24) 

As  a  result,  imposing  (19)  yields  (23),  with  P(z )  = 

2  rQ(— z~1)D(z),  which  is  bounded  on  the  unit  circle  by  the 
stability  hypothesis.  Conversely,  since  H(z)  =  G(—z~1)D(z), 
imposing  (24)  with  the  assumption  that  P(z)  is  stable  implies 
(19)  with  Q(z)  =  2 1  P(—z~l)D(z),  which  is  stable  as  well. 
Finally,  because  2/it  <  |(1  —  e~^w)/jut |  <  1  fora;  G  [— 7r,7r], 
we  note  that  the  factorization  (23)  with  P(z )  stable  is  equivalent 
to  the  condition  G(eJU’)  <  C  ■  |u;|7  =  0(|u;|7).  □ 

To  get  a  better  feel  for  this  result,  we  consider  (23)  and  take 
the  limit  to  obtain  the  asymptotic  form  of  the  filter  as  u>  — >  0. 
Using  the  fact  that  1  —  e 4“  =  —jut  +  0(cu2)  and  assuming  that 
P(e-JW)  is  continuous,  we  obtain 

G(ejuJ)  =  B0(-ju>)\  as  w  — >  0  (25) 

where  P0  =  P(l)  =  —  2~'1Q(— 1)/G(— 1)  f  0  because 
G(— 1)  <  00  (stability)  and  Q{—  1)  -f  0  (<p  is  of  maximum 
order  7). 

The  transfer  function  of  the  analysis  wavelet  is  obtained  by 
taking  the  Fourier  transform  of  (8)  with  g  =  g  and  ip  =  ip: 

l(w)  =  G(ej“/2)  ■£(!)•  (26) 

Its  low-frequency  response  obviously  depends  on  the  behavior 
of  G{e>u)  near  the  origin. 

Theorem  7:  Let  ip  and  p  be  two  valid  biorthogonal  scaling 
functions  with  p(uj)  and  p(u>)  continuous  at  to  =  0.  Then,  ip  is 
of  order  7  (i.e.,  tp  =  /3+_1  *  ipf)  if  and  only  if 

=  o(M7). 


The  proof  is  given  in  the  Appendix.  If  we  now  assume  that 
P(eJU’)  is  continuous  as  well,  we  can  easily  obtain  the  asymp¬ 
totic  version  of  the  result  by  plugging  (25)  into  (26)  and  making 
use  of  the  property  limw^o  P(w)  =  1-  This  yields 


ip{ijj)  =G7  ■  (—jut)' .  as  hj  — ¥  0 

with  C7  =2_7P(1)  =  — 2“27^— ^  0. 

G(-l) 


(27) 


The  practical  implication  of  Theorem  7  is  that  the  wavelet 
transform  acts  as  a  multiscale  differentiator.  Specifically,  the 
wavelet  coefficients  will  correspond  to  the  samples  of  the  7th 
derivative  of  a  smoothed  version  of  the  input  signal  f(x): 


These  are  strong  constraints  that  have  a  direct  implication  on  the 
form  of  the  wavelet  filter  G(z). 

Proposition  6:  Under  the  constraint  of  a  stable  perfect  recon¬ 
struction  filterbank,  the  order  condition  (19)  is  equivalent  to  the 
following  factorization  of  the  wavelet  analysis  filter: 

G(z)  =  (1  —  zf1  ■  P(z)  (23) 

where  the  filter  P(z)  is  stable,  i.e.,  |P(eJUJ)|  <  G.  Another 
equivalent  formulation  is  G(eJW)  =  O  (|w|7). 


(f(x),ip(x  -  (/))  =  <97  {<p  *  /}  (u) 

where  the  smoothing  kernel  is  defined  by  its  frequency  response 
<p(w)  =  ip*(uj) /(jw)7;  it  is  necessarily  lowpass  and  bounded 
because  of  (27)  and  Theorem  7.  This  kernel  essentially  limits  the 
bandwidth  of  the  signal  being  analyzed  with  two  practical  ben¬ 
efits.  First,  it  regularizes  the  differentiation  process  by  reducing 
its  noise  amplification  effect,  and  second,  it  attenuates  the  signal 
components  above  the  Nyquist  frequency  so  that  the  differenti¬ 
ated  signal  is  well  represented  by  its  samples  (or  wavelet  coef¬ 
ficients). 
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In  the  classical  case  where  7  =  L  is  an  integer,  there  is  an 
equivalence  between  (27)  and  the  vanishing  moment  property. 
This  can  be  shown  by  determining  the  Taylor  series  of 

around  uj  =  0:  V;(w)  =  Cl  ■  (—jv)L  +  0(wi+1),  where  Cl  = 
I  xLip(x)dx/ L\  is  proportional  to  the  magnitude  of  the  first 
nonvanishing  moment. 

In  the  fractional  case,  there  is  no  Taylor  series  of  order  L  and, 
therefore,  no  direct  equivalence  with  the  vanishing  moments 
property.  The  unique  property  of  this  kind  of  wavelets  is  that 
they  give  access  to  fractional  orders  of  differentiation,  which 
can  be  very  useful  in  some  applications  [34],  in  particular,  when 
dealing  with  fractal-like  signals. 

IV.  Wavelet  Differentiability  and  Integrability 

One  of  the  primary  reasons  for  the  mathematical  successes 
of  wavelets  is  their  stability  with  respect  to  differentiation  and 
integration.  In  the  conclusion  of  his  classical  monograph,  Meyer 
writes:  “everything  takes  place  as  if  the  wavelets  ip(x/a)  were 
eigenvectors  of  the  differential  operator  </' ,  with  corresponding 
eigenvalue  a~a”  [35],  We  will  now  use  our  B-spline  formalism 
to  shed  some  light  on  this  important  aspect  of  wavelet  theory. 

As  starting  point,  we  use  the  B-spline  factorization 
(20)  together  with  the  B-spline  convolution  property 
(/3+-1  =  /3+_1_s  *  /3+-1)  to  manipulate  the  basic  scaling 
function  biorthogonality  relation  as  follows: 

(ip(x),  tp(x  -  k))  =((/31-1  *  p)(x),  (/3^_1_s  *  <p0)(x  -  k)) 


where  j3®(x)  =  /?“(— x)  denotes  the  anticausal  B-spline  of  de¬ 
gree  a.  This  formula  suggests  that  the  new  pair  of  functions 
(p7+s,  <Pj-s)  should  also  generate  a  valid  biorthogonal  basis.  In 
the  sequel,  we  will  show  that  these  scaling  functions  play  a  cru¬ 
cial  theoretical  role  for  they  provide  the  building  blocks  for  the 
fractional  integrals  and  fractional  derivatives  of  the  analysis  and 
synthesis  wavelets,  respectively.  For  this  purpose,  we  will  first 
investigate  the  extent  to  which  p  (or  equivalently  ip)  is  differ¬ 
entiable  and  propose  a  peeling  formulation  of  smoothness  that 
provides  some  new  insights  on  the  various  notions  of  wavelet 
regularity.  We  will  also  prove  that  the  B-spline  component  is  en¬ 
tirely  responsible  for  the  smoothness  of  the  basis  functions.  Fi¬ 
nally,  in  Section  IV-C,  we  will  specify  the  biorthogonal  wavelet 
basis  that  is  associated  with  the  fractional  differentiation  oper¬ 
ator. 

A.  Smoothness:  Peeling  Theory 

In  this  subsection,  we  characterize  the  fractional  derivatives 
of  the  scaling  function  and  show  that  the  presence  of  the 
B-spline  component  is  absolutely  necessary  for  these  to  be 
well-defined.  The  argument  is  entirely  based  on  the  convolu¬ 
tion  property  of  B-splines  (cf.  Section  II-C).  Specifically,  we 
rewrite  the  B-spline  factorization  (20)  in  terms  of  the  function 
(fj-r  that  are  already  encountered  in  (28): 

p7(x)  =  /3+_1  *  p-,-r(x)  where  p1-r(x)  =  *  po(x) 

(29) 


with  a  new  subscript  notation  that  makes  the  order  of  the  func¬ 
tions  explicit,  e.g.,  p  =  py.  We  then  use  this  relation  to  compute 
the  rth  derivative  of  p  for  0  <  r  <  7: 

dr Py(x)  =  dr/31^r1  *  <Py-r(x). 

Using  (15)  with  7  =  r  and  a  =  r  —  1  and  noting  that  /3-1(t)  = 
S(x),  which  is  consistent  with  (10),  we  get 

drpy(x)  =  A+py—r(x).  (30) 

This  explicit  time-domain  differentiation  formula  is  known  for 
r  integer,  but  its  extension  for  arbitrary  r  is  new  to  the  best 
of  our  knowledge.  This  result  indicates  that  the  function  <p7_r 
generates  a  basis  for  representing  the  fractional  derivatives  of 
the  scaling  function  and  of  the  wavelet;  indeed,  by  linearity,  we 
have  that  dr  {ip(x/ 2}  =  2  ^2k  g(k)drp(x  —  k).  The  question 
that  then  arises  is  the  following:  How  far  can  we  differentiate, 
or  equivalently,  how  much  B-spline  can  we  “peel  off’  before 
the  so-called  residue  <p7_,,  in  (29)  really  blows  up?  What  we 
mean  by  “blowing  up”  can  be  made  mathematically  precise  by 
requiring  that  some  Lp- norm  of  the  residue  <p7_r  remains  finite. 
This  particular  interpretation  turns  out  to  be  intimately  related 
to  the  concept  of  smoothness  in  a  general  L^-sense  and  leads  to 
the  following  smoothness  characterization  theorem. 

Theorem  8:  If  p-y(x)  =  /3+_1  *  p-y-r(x)  with  p7_,,  G  Lp, 
then  8rp-1  G  Lp ,  i.e.,  <p7  has  r  derivatives  in  the  Lp  sense. 

Proof:  First,  we  rewrite  (30)  as 

drp1(x)  =  =  E  ar(k)py-r(x  —  k ) 

fee  z 

where  the  coefficients  of  the  finite  difference  operator  are  given 
by  (12).  We  also  recall  that  ar(k )  «  0(l/kr+l),  which  implies 
that  the  sequence  ar{k)  is  absolutely  summable  (i.  e.,  ar  G  l\ 
for  r  >  0).  We  then  use  the  above  formula  in  conjunction  with 
Minkowsky’s  inequality  to  obtain  the  following  bound  for  the 
Lp- norm  of  the  derivative 

fee  z 

which  proves  the  desired  results.  □ 

Theorem  8  provides  an  explicit  link  between  the  smoothness 
properties  of  wavelets  and  the  B-spline  factorization.  It  is  also 
interesting  because  it  yields  a  general  and  coherent  approach 
to  the  concept  of  smoothness,  i.e.,  fractional  differentiability  in 
the  ip-sense.  Forp  =  2,  the  present  definition  of  smoothness  is 
equivalent  to  the  widely  used  Sobolev  regularity  [36].  Another 
interesting  case  is  p  -a  00  because  it  penalizes  the  worst  case 
(max  norm);  this  is  very  close  to  Holder  regularity,  even  though 
the  latter  is  a  measure  of  continuity  rather  than  of  differentia¬ 
bility  [19],  [37], 

Our  peeling  theory  of  smoothness  is  illustrated  in  Fig.  6  for 
the  case  of  Daubechies’  scaling  function  of  order  2.  It  is  clear 
from  the  graph  that  the  residue  <p7_r  becomes  rougher  as  a 
higher  order  B-spline  component  gets  pulled  out.  These  various 
plots  were  obtained  by  running  a  Fourier  version  [38]  of  the  cas¬ 
cade  algorithm  with  ten  levels  of  iteration  on  the  residual  factor 
of  the  refinement  filter  corresponding  to  <p7_r  in  (29).  Past  the 
limiting  case  (r  =  0.55)  [37],  <p7_r  is  no  longer  bounded,  and  it 
does  not  make  much  sense  to  attempt  to  represent  it  graphically. 
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Fig.  6.  Residual  factor  if  7_r(;u)  in  (29)  as  a  function  of  r  for  Daubechies'  scaling  function  of  order  7  =  2.  The  critical  value  =  0.55  corresponds  to  the 
limiting  case  where  pkr(x)  is  bounded  (r  derivatives  in  Loo-sense).  It  also  yields  the  Holder  exponent. 


Note  that  there  are  computational  techniques  for  determining 
this  critical  Holder  exponent  (cf.  Section  IV-B). 

To  get  some  further  insight  on  our  Lp  notion  of  smoothness, 
we  now  consider  the  example  of  the  fractional  B-splines.  The 
borderline  cases  here  are  the  B-splines  of  negative  degree;  these 
can  be  shown  to  belong  to  the  following  functional  spaces: 

3lu(x)  £  Lp.  for  p  >  —  and  0  <  u  <  1. 

~r  u 

For  the  fractional  splines,  we  have  7  =  a  +  1,  so  that  we  can 
write  (29)  as  p"  =  ff/T1  *  where  0*Tr  =  <p7_,,  is  the 
corresponding  residue.  Applying  Theorem  8,  we  conclude  that 
pa  has  up  to  s(p)  =  a  +  1/p  derivatives  in  the  Z^-sense  with 
the  strict  equality  only  being  reached  for  p  =  00.  Indeed,  we 
have  already  mentioned  that  <9Q/3“  (+)  is  bounded  ( a  derivatives 
in  Zoo),  which  is  consistent  with  the  fact  that  0"  is  a-Holder 
continuous. 

There  is  also  a  converse  part  to  Theorem  8,  albeit  only  within 
the  restricted  framework  of  wavelets. 

Theorem  9:  If  p  is  a  valid  scaling  function  such  that  ds p  £ 
Lo,  then  p(x)  =  /3+_1  *  p 0(x)  with  p0  £  Z2. 

The  proof  is  more  technical  and  is  given  in  the  Appendix  . 
Since  the  order  of  the  B-spline  factor  0s ~ 1  in  Theorem  9  is 
s,  the  above  statement  is  equivalent  to  saying  that  the  order 
of  a  valid  scaling  function  p  is  necessarily  greater  or  equal  to 
its  critical  Sobolev  exponent  smax.  In  other  words,  we  have 
that  8sp  £  Z/2  =>  7  >  s.  Vs  <  smax.  This  is  a  new  re¬ 
sult  that  extends  a  classical  theorem  in  wavelet  theory  stating 
that  (^“-continuity  (m  integer)  implies  some  minimum  approx¬ 
imation  order:  p  £  Cm  =>•  7  >  m  [5].  Note  that  our  re¬ 
sult  is  slightly  more  generous  (i.e.,  it  yields  more  order);  this 


follows  from  the  Sobolev  inequality  (for  compactly  supported 
functions)  smax  — 1/2  <  a  <  smax,  where  a  >  m  is  the  Holder 
exponent  of  p.  In  fact,  our  bound  is  sharp,  as  demonstrated  by 
the  example  below. 

The  B-splines  of  order  7  are  very  regular,  but  they  fall  short  of 
the  maximum  possible  Sobolev  smoothness  by  1/2.  Yet  we  can 
perturb  the  B-splines  to  achieve  the  maximum  possible  smooth¬ 
ness  by  taking  Q(z )  =  (l  +  £  +  z_1)  /(2  +  e)  in  (19),  where 
0  <  s  <  1  is  small  but  nonzero.  Since  we  are  dealing  with  a 
two-tap  residual,  the  Holder  (alias  L0 0)  and  Sobolev  (alias  L2) 
exponents  can  be  computed  explicitly  using  the  technique  out¬ 
lined  in  [19]  and  [37],  Specifically,  we  find  that 


a  =7  -  log 2 


2  +  2e 


'  2  In  2 

Smax  —0  —  A  l°g2  • 


+  0(e2),  forO  <  £  <  1 
(  2  +  2e  +  e2\ 

2  l  '  (2  +  e)2  ) 

+  0(e 4),  as  £  — >  0+. 


Thus,  by  letting  £  tend  to  0+,  we  are  able  to  saturate  the  above 
inequalities  such  that  7  =  smax  =  which  proves  that  the 
bound  in  Theorem  9  is  sharp.  If  we  now  push  it  a  little  further 
by  letting  e  =  0,  we  increase  the  B-spline  factor  by  one  which 
makes  us  jump  to  smax  =  7+1/2,  while  the  Holder  continuity 
remains  at  a  =  s(oo)  =  7,  but  at  the  same  time,  the  order  is  also 
increased  by  one  when  compared  with  the  previous  degenerate 


case. 


The  functions  that  saturate  the  inequality  (i.e.,  ,srnax  =  7)  are 
the  only  ones  for  which  the  residual  in  (29)  can  be  in  L2-  Indeed, 
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if  ipo  would  have  some  residual  Sobolev  smoothness,  it  would  be 
possible,  by  Theorem  9,  to  factor  out  a  larger  order  B-spline  that 
would  contradict  the  assumption  that  ip  is  of  maximum  order 
7.  For  all  other  cases,  ipo  is  not  in  Lo  and  is  typically  only  a 
distribution;  for  instance,  ipo(x)  =  S(x)  E  S'  in  the  case  of 
the  B-splines.  We  can  therefore  safely  state  that  tpo  in  (20)  has 
no  Sobolev  regularity  at  all,  which  also  means  that  there  is  no 
smoothness  possible  without  the  B-spline  part  of  the  wavelet. 

While  smoothness  implies  approximation  order,  there  is  gen¬ 
erally  no  such  implication  in  the  reverse  direction  [39].  In  other 
words,  the  presence  of  the  B-spline  factor  is  not  enough  to  guar¬ 
antee  that  the  scaling  function  is  in  Lo,  not  to  mention  its  differ¬ 
entiability.  Indeed,  one  can  conceive  of  very  irregular  wavelets 
where  the  distributional  part  ip 0  is  so  rough  that  it  counterbal¬ 
ances  the  inherent  smoothness  of  the  B-spline  factor.  This  is  the 
reason  why  smoothness  is  relatively  difficult  to  control  when 
applying  conventional  filter  design  procedures;  it  is  a  property 
that  is  usually  checked  a  posteriori. 

B.  Determining  Wavelet  Smoothness 

The  peeling  theory  provides  an  interpretation  of  wavelet 
smoothness  that  is  appealing  intuitively.  However,  we  should 
not  be  fooled  by  the  apparent  simplicity  of  the  argument. 
Wavelet  smoothness  is  and  remains  one  of  the  most  difficult 
theoretical  aspects  of  wavelet  theory.  While  Theorem  8  pro¬ 
vides  a  mathematical  criterion  for  testing  differentiability  in  the 
Lp -sense,  it  does  not  give  a  numerical  method  for  determining 
the  critical  exponents  s(p )  for  a  given  filter  H(z).  As  far  as  we 
know,  an  exact  computation  is  only  possible  for  determining  the 
Sobolev  index  (i.e.,  i^-smoothness)  of  compactly  supported 
basis  functions.  The  preferred  method  is  based  on  the  determi¬ 
nation  of  the  spectral  radius  of  the  so-called  transition  operator; 
in  practice,  this  amounts  to  computing  the  maximum  eigenvalue 
of  a  reduced  transition  matrix  associated  with  the  residual  filter 
Q(z )  [6],  [20].  Various  techniques  have  also  been  proposed 
to  estimate  the  Holder  exponent  (alias  L^-smoothness)  [37], 
[19],  These  methods  are  not  exact  anymore  (except  when  ip  is 
symmetrical)  but  can  provide  tight  upper  and  lower  bounds. 

Determining  fractional  orders  of  smoothness  in  norms 
other  than  L2  is  more  challenging  mathematically,  especially 
since  there  is  no  single  universal  definition  of  fractional 
differentiation  in  the  time  domain.  Mathematicians  have  turned 
the  difficulty  by  testing  the  appartenance  of  the  function  to 
some  “smoothness”  spaces.  Villemoes  [20]  proposed  to  study 
wavelet  smoothness  using  Besov  spaces,  which  leads  to  the 
determination  of  a  critical  Besov  exponent  that  is  qualitatively 
similar  to  the  notion  of  /,,,-smoothness  that  we  have  considered 
here.  His  approach  is  based  on  the  fact  that  wavelets  provide 
unconditional  bases  for  Besov  spaces  [35].  More  recently, 
Micchelli  and  Sauer  have  proposed  to  extend  the  Holder 
notion  of  continuity  to  Lp- space  by  introducing  what  they 
call  generalized  Lipschitz  spaces  [40].  Their  formulation  is 
general  but  also  rather  involved — about  90  pages  of  dense 
mathematics.  By  contrast,  our  approach  to  measuring  wavelet 
smoothness  is  much  closer  to  a  classical  Lp- Sobolev  analysis, 
except  that  the  traditional  method  for  p  f  2  is  restricted  to 
integer  orders  of  differentiation.  What  makes  the  Sobolev 
technique  applicable  here  is  our  explicit  time-domain  formula 


(30)  for  fractional  differentiation,  which  is  specific  to  scaling 
(or  refinable)  functions. 


C.  Biorthogonal  Wavelet  Basis  for  Fractional  Differentiation 

The  stability  of  wavelets  with  respect  to  differentiation  is  a 
prerequisite  for  using  them  to  characterize  smoothness  spaces, 
in  particular,  for  proving  that  they  provide  unconditional  bases 
for  Sobolev,  Holder,  and  Besov  spaces  [35].  Although  these 
functional  aspects  of  wavelet  theory  are  beyond  the  scope  of 
this  paper,  we  want  to  make  an  interesting  connection  by  iden¬ 
tifying  and  characterizing  the  biorthogonal  wavelet  basis  asso¬ 
ciated  with  the  fractional  differentiation  operator. 

The  idea  is  simple  and  is  based  on  the  following  manipulation 
of  the  wavelet  biorthogonality  relation: 

{4>(x),  if(x  —  k ))  =  ( dZsf>(x ),  dsip(x  —  k ))  =  6k  (31) 

where  dZs  < — -  denotes  the  anticausal  fractional  in¬ 

tegration  operator;  it  is  the  adjoint  of  d~s  < — »  ( jco)~s ,  which 
is  the  inverse  of  the  fractional  differentiation  operator  ds.  Based 
on  the  results  in  Section  IV- A,  we  obtain  the  explicit  form  of  the 
“derivative”  wavelet 


4>^s\x)  :=  dsip(x )  =  2  gs(k)<p1-a{2x  -  k) 

fee  z 

where  </?7_s  is  defined  by  (28),  andGs(z)  =  2S(1  —  z~1)sG(z). 
We  now  close  the  loop  by  showing  that  the  biorthogonal  partner2 
of  ip^-s  in  (29),  that  is,  the  scaling  function 

=  /3i_1  *  <£7 

provides  the  complementary  basis  for  expanding  DZf'f.  which 
is  the  s-order  fractional  integral  of  the  analysis  wavelet.  This 
manipulation  is  performed  in  the  Fourier  domain  starting  from 
(26) 


d_sf)(x ) 


*0-3  (1  ~^/2Y 


(-¥) 

x  (l  -  e-7'^2)7  A  . 


The  key  here  is  the  result  of  Proposition  6,  which  allows  us 
to  extract  the  relevant  B-spline  factor  as  long  as  s  <  7.  After 
inverse  Fourier  transformation,  we  end  up  with  a  stable,  explicit 
representation 


$(  s\x)  =  d_sip(x)  =  2 ^ g-s(k)ipj+r(2x  —  k) 
he  z 


where  G-S(z)  =  2  S(1  —  z)’1  8P{z).  Indeed,  we  can  invoke 
Young’s  convolution  inequality  (cf.  [42]) 


°-j+s  ||  r 


<  H/?!-1 


L 1 


"'Til  Lv 


which  proves  that  (p^  E  Lp  =>  f>-y+s  €E  Lp  (since  fts_  1  E 
L\,  s  >  0).  Hence,  we  have  established  that  the  new  wavelet 


2This  is  a  suggestive  denomination  that  we  are  happy  to  borrow  from 
Vaidyanathan  [41]. 
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Sobolev  smoothness 

Unconditional  bases 

5scp  e  Lj 

1 

=> 

for  Besov  spaces 

L  AND 


ft 


B-spline  factorization: 

Approximation  order: 

Multi-scale  differentiator 

<p=fr*cp0 

<=> 

. ^ 

<=> 

\j/(a))oc(-jco)Y,  (1)  — >  0 

ft 


ft 


general  case:  n  <  y  <  n  + 1 

compact  support:  y=n  +  l  (Strang-Fix,  1971) 


Polynomial  reproduction 
degree:  «  =  fy  — 1~[ 

ft 

Vanishing  moments: 
^xp^f{x)dx  =  0,  p  =  0,...,n 


Fig.  7.  Summary  of  wavelet  theory:  main  properties  and  equivalences. 


pair  (- t/>(s()  generates  a  biorthogonal  basis  that  is  in  Lp 
provided  that  8s  ip  G  Lp  and  Cp  G  Lp.  Note  that  it  is  essential 
here  to  work  with  wavelets  because  the  fractional  integral  of  a 
scaling  function  is  generally  not  in  Lp  (since  <^(0)  ^  0). 

Finally,  by  using  the  rescaling  property  8s  {f(x/a)}  = 
a~sds  {/}  (x/a),  we  obtain  the  differentiation  formulas  for 
the  basis  functions: 

ds  {ipi,k}  =  2 and  d-S  {$;,&}  =  2  (32) 

which  are  the  “eigen-relations”  to  which  Meyer  was  alluding, 
even  though  he  did  not  write  them  down  explicitly.  Of  course, 
the  qualifying  statement  is  not  rigorously  correct — the  im¬ 
portant  point  is  that  there  are  basis  functions  with  the  same 
wavelet  structure  on  both  sides  of  the  identity.  The  practical 
relevance  of  these  “differential”  wavelets  is  that  they  give  us  a 
direct  way  of  gauging  the  fractional  derivative  of  a  signal  based 
on  its  wavelet  coefficients  in  the  original  basis.  Specifically, 
we  have  that  </,^,fc)  =  (ds/,dls^,fc)  =  2 zs (ds  f,^~s)), 
which  implies  that  the  original  wavelet  coefficients  divided  by 
2IS  are  the  coefficients  of  8s  f  in  the  modified  wavelet  basis 
!  l  )  •  This  provides  a  strong  hint  as  to  why  the  Sobolev 
norm  of  a  signal  can  be  measured  from  the  Z  2 -norm  of  its 
rescaled  wavelet  coefficients.  The  Besov  case  is  analogous 
with  Zp-norms  being  used  instead;  the  argument  there  is  more 
involved  and  relies  on  some  Riesz-type  ip-norm  equivalences 
[35],  In  addition,  note  that  the  wavelets  that  have  just  been 
specified  are  fractional  ones,  which,  in  themseves,  may  serve 
as  an  a  posteriori  justification  for  our  extended  formulation. 


V.  Conclusion 

In  this  paper,  we  proposed  a  new  spline-based  interpretation 
and  derivation  of  some  fundamental  wavelet  properties.  Our 
argumentation  entirely  rests  on  the  representation  of  a  scaling 
function  as  the  convolution  between  a  B-spline  and  a  distribu¬ 
tion:  =  (P~l  *  ipo.  The  B-spline  constitutes  the  regular  part 

of  the  wavelet,  and  its  presence  is  necessary  (and  sufficient)  for 
the  transform  to  have  approximation  order  7  >  0  (cf.  Theorem 
2).  In  particular,  this  implies  that  one  cannot  build  a  multireso¬ 
lution  wavelet  basis  of  L2  without  any  B-spline  factor.  We  have 
also  shown  that  the  key  properties  of  B-splines  (smoothness, 
differentiability,  reproduction  of  polynomial,  etc.)  get  carried 
over  to  the  scaling  function  in  an  essentially  mechanical  fashion 
through  the  convolution  relation.  These  results  and  the  various 
equivalences  that  have  been  established  are  summarized  in 
Fig.  7.  The  advantage  of  approaching  the  problem  from  this  end 
is  that  these  basic  properties  are  relatively  easy  to  understand 
for  the  B-splines  because  of  their  simple  analytical  form  in 
both  the  time  and  frequency  domain.  This  particular  point  of 
view  leads  to  the  conclusion  that  the  primary  mathematical 
wavelet  properties — order  of  approximation,  reproduction 
of  polynomials,  vanishing  moments,  and  smoothness  of  the 
basis  function — are  entirely  due  to  the  B-spline  component. 
Another  interesting  property  that  has  been  explained  from  the 
B-spline  perspective  is  the  stability  of  the  wavelet  basis  with 
respect  to  fractional  differentiation.  Our  key  observation  is 
that  there  is  a  direct  correspondence  between  the  process  of 
moving  a  B-spline  factor  from  one  side  to  another  in  a  pair  of 
biorthogonal  scaling  functions  and  the  exchange  of  fractional 
integrals/derivatives  on  their  wavelet  counterparts. 
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If  these  fundamental  properties  are  to  be  attributed  to  the 
B-spline  part  exclusively,  then  what  is  the  purpose  of  the 
distribution  po  corresponding  to  the  residual  filter  ()(z)l  This 
part  is  essential  for  imposing  additional  properties — such  as 
orthonormality — and,  more  important,  to  balance  the  local¬ 
ization  properties  (size)  of  the  analysis  and  synthesis  filters  in 
Fig.  1.  B-splines  are  optimal  in  terms  of  size  and  smoothness, 
but  they  are  not  orthogonal.  To  construct  a  pure  spline  wavelet 
transform,  one  needs  to  orthogonalize  the  B-splines  [43], 
[44]  or  to  specify  a  dual  pair  of  spline  functions  [45]-[47]; 
in  both  cases,  this  is  equivalent  to  selecting  a  distributional 
part  of  the  form  ipo(x)  =  XXezF^’)*^  —  &)>  where  p(k)  is 
an  appropriate  digital  filter.  However,  the  implementation  of 
these  (semi-orthogonal)  spline  transforms  requires  HR  filters, 
which  is  often  considered  a  handicap.  Early  on,  Daubechies 
and  others  [8],  [9]  have  shown  that  the  only  way  to  construct 
wavelets  and  basis  functions  that  are  compactly  supported 
on  both  sides  (analysis  and  synthesis)  is  by  careful  selection 
of  the  factor  (){z),  eventually  moving  it  to  the  analysis  side, 
which  yields  biorthogonal  spline  wavelets  [8].  This  explains 
why  all  popular  wavelet  families  (Daubechies,  Coiflets, 
Cohen-Daubechies-Feauveau,  9/7,  etc.)  include  nontrivial 
distributional  factors  ipo(x)  and/or  (po(x ),  which  we  like  to 
view  as  the  price  to  pay  for  the  additional  but  more  classical 
filter  design  constraints  (e.g.,  FIR  filterbank  and  orthogonality). 

Appendix 

Proofs 


A.  Proof  of  Theorem  7 

If  p>  is  of  order  7,  then  G(e^u)  =  0(|w|7)  by  Proposition  6. 
This,  together  with  the  wavelet  scaling  (26)  and  the  bounded¬ 
ness  of  ip(u>)  (consequence  of  the  Riesz  condition),  implies  that 

i>(u)  =  0(M7).  ^ 

Conversely,  if  tp(u>)  =  0(|w|7),  then  G{e^)ip{u)  = 

0(|w|7).  Now,  using  the  assumption  that  (p(u>)  >  0  in  a 
neighborhood  of  ui  =  0  (consequence  of  the  continuity  of  ip(u>) 
at  u>  =  0  since  cp(0)  =  1)  and  the  stability  of  the  filter  G(z), 
we  can  claim  that  G(e ■7“)  =  0(|w|7).  This  is  also  equivalent  to 
(23),  where  P(z )  is  stable,  as  seen  in  the  proof  of  Proposition 
6.  Thus,  from  Proposition  6,  this  is  equivalent  to  the  7-order 
property  of  ip.  □ 


B.  Proof  of  Theorem  9 

We  require  one  (very  mild)  technical  assumption,  namely, 
that  the  refinement  filter  H{e^)  be  continuous  at  uj  =  0.  To 
prove  Theorem  9,  we  define 


=  E 

ke  z 


u>  +  2kir 
1  -  (->~ 


2s 

1 1 p(u}  +  2/u’7t)|2. 


(33) 


This  is  a  true  function  (possibly  taking  the  value  +00  for  some 
w)  because  it  is  a  sum  of  positive  functions.  We  observe  that 
because  of  (6),  B(cu)  satisfies  the  scaling  relation 

B(2lj)  =  \Q0(e^)\2B(cu)  +  \Q0(-e^)\2B(uj  +  7 r)  (34) 


where  we  have  defined  Qq(z )  =  2sH(z)/(l  +  z  1)s.  In  par¬ 
ticular,  this  implies  that 

\Q0(ei“)\2B(iu)  <  B(2co). 


We  now  prove  that  B(cu)  is  integrable  over  [— tt,  7t]. 

First,  we  observe  that  B(u>)  is  integrable  over  any  closed 
subset  I  of  [— 7T,  7r]  that  does  not  include  0.  This  is  because 
|1  —  e~^\~2s  is  bounded  over  /,  which  implies  that  B(uj)  is 
bounded  by  lw  +  2,k'K\2s\p{LO  +  2kir)\2  up  to  a  multi¬ 

plicative  constant.  Now,  because  |w|s|</>(w)|  is  in  L2 

\u>  +  2ki:\2s\Lp(uj  +  2A:7r)  |2 

fee  z 

is  in  Li([— 7T,7r])  and,  thus,  is  in  Li(I). 

Second,  the  continuity  of  |iT(e3“)| — and  of  |l+e_JW  |s — at  0 
implies  that  for  any  s  >  0 — here,  we  choose  e  <  1/2 — we  can 
find  0  <  ujo  <  7r  such  that  for  all  |w|  <  u>o-  IQoCe--7^)!  ^  2-£. 
Consequently 

V|w|  <  wq,  2 ~2sB(uj)  <  B(2lo). 


Then,  we  define  bn  =  JT  B(u ;)  dw,  where  In  =  2_"wo  X 
([—1,  —1/2]  U  [1/2, 1]).  bn  are  bounded  quantities  because  the 
In' s  are  closed  subsets  of  [— 7r,  7t]  \  {0}.  We  thus  have  < 
2 2s~1bn,  i.e.,  b„  <  2 2ne~nbo  or,  by  summing  up  all  the  contri¬ 
butions 


/|u>M)j€[l/2",l] 


B(uj)  du!  < 


bp 

1  -  22s~1 


which  is  bounded,  independently  of  n.  Fetting  n  tend  to  infinity, 
we  conclude  that  B{lo)  is  integrable  in  [—  ojq,  wo]  (Fatou’s  the¬ 
orem),  in  particular,  in  the  neighborhood  of  0.  Thus,  B(lo)  is 
fully  integrable  in  [— 7r,  7r]. 

It  is  now  a  simple  matter  to  state  that  the  function  <po{uf)  = 
tpfjj) / j3s~l {uj)  is  in  Lo-  This  is  because 


|</>0(w)|2  dw=E 

A-ez  • 


/7T 

B(lu)  duKoo 


where  B(u>)  is  the  same  as  defined  by  (33)  and  has  just  been 
shown  to  be  integrable  in  [  —  7r,  7t]  .  □ 
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